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Abstract 

This paper provides a necessary and sufficient condition for detectability, and an explicit 
construction of observers when this condition is satisfied, for chemical reaction networks of 
the Feinberg-Horn-Jackson zero deficiency type. 
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1 Introduction 

One of the most interesting questions in control theory is that of constructing observers. Ob- 
servers compute estimates of the internal states of a dynamical system, using data provided 
by measurement probes or partial state information. For linear systems, Luenberger observers 
(also known as "deterministic Kalman filters" since they amount to Kalman filters designed 
without regard to the statistics of measurement noise) provide a general solution, but, for non- 
linear systems, establishing generally applicable conditions for existence and convergence of 
observers is an open and active area of research. 

The question of constructing observers has long been of interest in chemical engineering, and 
in particular for bioreactors; see for instance the papers ||, [l5|] and the book Q. Observers 
are of great potential interest, in particular, for biological experimentation and biomedical 
applications, where the online monitoring of proteins involved in signaling pathways (using 
for instance fluorescent labelings of molecules) will lead to a better understanding of cellular 
dynamical processes. 

The main objective of this paper is to construct (when and if possible) state observers for the 
class of systems which describe chemical reaction networks of the type introduced by Feinberg, 
Horn, and Jackson in ||, ^, [8], ||, |ll], |TJ]. As outputs, we take a subset of state variables 
or, more generally, monomials in state variables (which could, in practice, be associated to 
measured reaction rates) . We provide here a complete solution to the observer problem for this 
class. The results given in |2(J provide a convenient formalism as well as a set of technical tools, 
in the form of Lyapunov estimates, which are central to our results, and we will repeatedly refer 
to that paper for basic concepts and results. 
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As a first step in this paper, we prove a necessary and sufficient theoretical condition for 
detectability. As a second step, we proceed to explicitly construct two full-state observers that 
are guaranteed to converge globally, under the hypothesis that the system is detectable. 

In terms of performance, the second observer seems to have substantially slower convergence 
characteristics but, on the other hand, it is more "robust" to perturbations in measurements. 
This robustness can be quantified by an input-to-state stability (ISS) type of estimate, and we 
prove that the second observer is ISS with respect to a far wider class of external disturbances 
than the first (more efficient) observer. 

We also provide simulations which test the behavior of our observers in the presence of 
observation noise and even of unknown inputs acting on the system; the observers turn out to 
be surprisingly robust to such effects, but we leave for future work the formulation of theoretical 
results which quantify this robustness. The advantadges of our observers over the standard 
constructions of Luenberger and the extended Kalman filter are also illustrated by simulations. 
Most of the examples worked out in this paper, as well as the simulations testing the robustness 
of our observers, all concern the kinetic proofreading model proposed by McKeithan in [H] for 
T-cell receptor signal transduction (which motivated p0[ ). 

The organization of this paper is as follows: 

• This section introduces the problem as well as some notations and definitions to be 
adopted in this work. 

• Sections || and || introduce the observers, and the main results are stated and proved. 
Some examples that illustrate the result are presented, applying the construction to the 
case of the McKeithan network. 

• Section ||| shows how to adapt the proof of the main result to a more general model. 

• Section |5| provides simulations, testing in particular the effect of noise and of unknown 
inputs acting on states, or state drift. A comparison between the Luenberger observer, 
an extended Kalman filter and our main observer is also given here. 

• An appendix collects various technical results. 
1.1 The Problem 

The main objective of this paper is to construct (when and if possible) observers for a certain 
type of systems that provide a mathematical model for a class of chemical reactions. Several 
stability and other control-theoretic results applicable to such systems were discussed and sur- 
veyed (but without outputs) in the paper (see also [fl). The systems that we study all have 
the generic form 

x = f{x) (1) 

y = Hx), 

with the requirements on / and h specified next. 

The function / : IR n — * M. n is of the mass-action kinetics form 

m m 

^^''u-'V'-'V'....'V'(^ h,). (2) 

i=l j=l 

where m < n and each bj is a column vector in M n and has entries b\j , b%j , . . . , b n j , which are 
nonnegative integers. It is assumed that B := [61,621 • • • ,b n ] has rank m and that none of its 
rows vanishes. 
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The constants a™ are all nonnegative, and the matrix A 



is assumed to be irreducible. 



In the language of Feinberg et. al., irreducibility amounts to a restriction to "single linkage 



class" systems. This restriction can be removed, as explained in [20], provided that the space T> 
introduced below is defined in a slightly different way to account for the number of connected 
components in the incidence graph of A. In order to simplify the presentation, the main result 
is stated for irreducible systems, and a sketch of how to treat the "multiple linkage classes" 
case is given in Section |3]. 

Although (|2|) is defined on all of W 1 , we will be interested only on those trajectories which 
evolve in the positive orthant M™ = {(x%, . . . , x n ) £ W n : Xj > for all i}. It is easy to verify 
(cf. |2(| and below) that M> is a forward invariant set for f^j when / has the form (|J). 

Define 

V := span{6j — bj : i, j = 1, . . . , m} 

and let the canonical basis of M n be {ei : i = 1, . . . , n}. 

The function / does not depend explicitly on time t; however, throughout the text, several 
variations of the system will be considered, obtained by adding input terms to /. The inputs 
will be assumed to be measurable and bounded functions u : [0, +oo) — > MP, and the resulting 
right hand side will be denoted by f*(x,u). 

In order to decide what kind of output functions h : M n — > MP are natural to consider, 
we should think of the quantities that may be measured when performing a chemical experi- 
ment. Some possibilities are, for example, concentrations of some of the substances, or certain 
reaction rates (through markers, fluorescence, or energy released). This leads us to consider 
outputs whose coordinates are monomials. This kind of output includes both the case when 
the concentrations of some of the substances are measured (xi, X2, etc.) and the case when 
some of the reaction rates are measured (proportional to a monomial such as xix|). 



Thus, we consider in this paper output maps h 



MP (typically, p < n), of the form: 



h{x) 



,CllCi2 



C p l C p2 



(3) 



x{ x 



■ Xr, 



where 



C 



Cn Cu ■ ■■ Cln 
C%\ C 2 2 • • • C 2n 



Cpl C p 2 



—pn 



is a matrix all whose entries are either or real numbers > 1. (In view of the preceding 
discussion, the most natural choice would be to take the entries of C to be nonnegative integers, 
but we allow more arbitrary exponents since the results do not require integers. The restriction 
Cij > 1 is imposed in order to insure that h{x) is locally Lipschitz, which is needed in order to 
guarantee uniqueness of solutions in the observer equations to be presented later. Although we 
are ultimately interested in behavior for positive Xj's, the ouputs make sense on M n , provided 
that we interpret exponents x\ as |xj| c for negative Xj's.) 
Let us introduce the following vector functions: 



p n (x) = (lnxi, . . . ,lnx n )' 



defined on 



, and 



Exp r 



3 



defined on W 1 . (From now on, we will drop the subscript n of p n and Exp n , since its value is 
usually clear from the context.) Then, for x S K>oi 

p(h(x)) = Cp(x) 

and 

h(x) = Exp(Cp(x)), 

as long as all state variables (concentrations, when dealing with chemical models) Xi are positive. 
No Boundary Equilibra Assumption 

For the rest of this paper, we will make the following assumption: the system ^) has no 
boundary equilibrium in any positive stoichiometric class. That is, if x = (x±, . . . , x n ) is any 
vector with nonnegative components Xi, and some component Xi of x vanishes, and if x — x S 
span{6j — bj, i,j = 1, . . . ,m} for some x £ R>o> then f(x) 7^ 0. This assumption amounts 
to saying that no reaction consistent with positive concentrations can be in equilibrium if one 
of the participating substances is at zero concentration. It is an assumption that is often 
satisfied in chemical reaction models, and is in particular satisfied in the main example (kinetic 
proofreading) to be discussed. It is possible to weaken this boundary assumption and still 
obtain significant (though more restricted) results (using the techniques developed in |^|) but 
we prefer not to do so in order to streamline the presentation. 

Under the above assumption, the following result is a simple consequence of the theory 



developed by Feinberg et. al., see [20]. A (positive) class is any intersection of M™ with an 
affine manifold of the form £ + T>, where £ £ M™ and T> = span{6j — bj : i,j = 1, . . . , m}. We 
use the notation |x| for Euclidean norms. 

Theorem 1 For each positive class C there exists a (unique) state x = xq & M™ which 
is a globally asymptotically stable point relative to C, i.e., for each xq £ C, the solution of 
x = f(x), x(0) = xq is defined for all t > 0, and x(t) — > x as t — ► oo, and for all e > there 
exists 5 > such that, if \x — xq\ < S, then \x — x(t)\ < e for all t > 0. 

1.2 Detectability 

We will adopt the following 

Definition 1.1 The system (||) is detectable (on M™ ) ^ f° r everv t w ° trajectories x(-) and 
z(-) in R™ , defined for all t > 0, 

h(x{t)) = h(z(t)) => \x(t) - z(t)\ -> as t -> oo. 



Remark 1.2 Although we use it in this paper, this is not the most natural definition of de- 
tectability, because it is not "well posed" enough. In principle, one would want the definition 
of detectability to also include the property: "h(x(t)) h(z(t)) for all t implies \x(t) — z(t)\ 
is asymptotically near zero as t — > oo" , which can be formulated as an "incremental output to 
state stability" (or more generally, "incremental input/output to state stability", if there are 
inputs) property. Such a more general concept, in the style of |l3| and [23], can also be studied, 
and a future paper will deal with that matter. 
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Definition 1.3 By a (full-state) observer for (|l|) we mean a system z = g(z, h(x)) evolving on 
a state space X which is an open subset of W 1 containing K™ , such that, for each x(0), z(0) in 
K> , the composite system has solutions defined for all t > 0, and \z(t) — x(t)\ — > as t — > +oo. 

This is a weak definition, on that only "attraction" and not stability is required; however, 
in our proofs we achieve a stability property as well, as will follow from "KL" estimates. 
Detectability is, obviously, necessary for the existence of an observer. 

The next observation is valid, in general, for any system x = f(x), y = h{x) for which 
every trajectory converges to an equilibrium. For such a system, we denote by E the set of all 
equilibria, i.e., all x such that f(x) = 0. 

Lemma 1.4 Detectability is equivalent to: 

[h(x) = h(z) kx,z£E]^x = z. (4) 

Proof, [necessity] Suppose that the system is detectable, and pick x / z distinct elements of 
E so that h(x) = h{z). Then x(t) = x and z(t) = z are two trajectories with h(x(t)) = h(z(t)) 
and distinct limits, a contradiction. 

[sufficiency] Suppose that (Q) holds and pick any two trajectories with h(x(t)) = h(z(t)). 
Since h is continuous, this implies h{x) = h(z) for the limits of x and z, which exist and belong 
to E by the assumption. Then @ says x = z, as we wanted to prove. I 

This lemma applies, in particular, to our system (||) (with / as in ©), when we view (|l|) 
as restricted to the invariant set M" - ^ n or der to be consistent with the notations in [gfl], we 
denote the equilibrium set as emphasizing that we are only considering equilibria in the 

positive orthant. 

Since the function Exp is one to one, for positive vectors x, z £ R™ we have 

h(x) = h(z) o Cp(x) = Cp(z), 
so that the condition h{x) = h{z) becomes just 

p(x) — p(z) £ ker C . 
Recall also this fact from |2(|: if x € E + , then, for any z £ M" 0) 

p(x) - p(z) G V 1 - z G E + . (5) 



Theorem 2 The following statements are equivalent: 

(a) The system (|l]) with / as in (||) and h as in (||) is detectable; 

(b) \/x, z G M>o> ^ P( x ) ~ P( z ) e ker C and x, z G E + , then x = z; 

(c) V ± f)kerC = {0}; 

(d) V + im C = W l . 

Proof, [(a) 44> (b)] That condition (^) is equivalent to (b) follows immediately from the discussion 
above. 

[(b) (c)] Pick any y G f]ker C; we need to show that y = 0. Let x be any point 
of E + and put y = p(x) — y, y G W l . Then find z = Exp(y) G M™ so that y = p(z). Thus 
y = p(x) — p(z) with x G E + and z G M> - definition of y, p(x) — p{z) is contained both in 
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V 1 - and in kerC. Condition (|5|) now implies that z G E + . By assumption (b), we now conclude 
that x = z, or equivalently, y = as wanted. 

[(c) =>■ (b)] Let x,z G satisfy both — p(z) G kerC and i,z£ Then, from (j5|), 
if follows that p(x) — p(z) G T> . Therefore, p{x) — p(z) G T> QkerC. By assumption (c) 
— p(z) = 0, and therefore, since p(-) is a bijective function on R> > we nave x = 
[(c) 44> (d)] This equivalence follows by duality. I 

Remark 1.5 Note that dimD = m — 1 and that detectability implies m — 1 + rank C' = n. In 
the case the matrix C has full rank, then rank C = p and detectability implies m — 1 + p = n. 



2 Constructing Observers 

It has been shown in [^] that the system (|l]) can be steered to any of its equilibrium points 
in E + by applying additive error feedback. Namely, for any specific element x G E+, a set K 
of indices such that T> + spanje^ : k G K} = M. n and any positive constants 7fc,/c G K, the 
following system satisfies lim^oo \x(t) — x\ = for any initial conditions x(0) G M™ : 

x = f(x) + ^2 7fc(x fc - x k )e k . (6) 
keK 

So, a natural starting point in constructing an observer for x = f(x), would be to take a 
copy of it and add a linear feedback term similar to the above: 

z = f(z) + ^2 lk{xk ~ z k )e k . 
keK 

We know that eventually the trajectories of the system we want to observe will satisfy x(t) — > x, 
for some x G E + , so it seems reasonable to expect that the dynamics of the z-system will become 
very close to the dynamics of the modified x-system, (|6|), as time tends to infinity. 

To construct such an observer, we need the "right output", that is, a function h(x) that 
allows the construction of the suitable ^k{ x k ~ z k) e k terms. Thus, this approach will only work 
when the output map is linear and of a very special form. For instance, if there exists a matrix 
L such that 

Ly = LCx = ^2 x k e k 

keK 

where K is such that P+span{efc : k G K} = M. n (for detectability), then the observer suggested 
above can be constructed (using Ly instead of y as input to the observer). 

But in the general case such a matrix L does not exist and a different, but related, con- 
struction will be used. The main result is: 

Theorem 3 Consider the system (|l]) and assume that it is detectable. Then the following 
system, with state space X = R n , is an observer for the system (|l]): 

z = f(z) + C'(h(x)-h(z)). (7) 



The proof is given in Section 2.2. 
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Example 2.1 Consider the system with n = 4 and m = 3, determined by the vectors: 
h = (1,1,0,0)', b 2 = (0,0,1,0)', and 63 = (0,0,0, 1)'. 

Then 

P = span{(l, 1,-1,0), (1,1,0,-1)}, 
and for positive constants k,k 3 ,k^ and (5 3 , the system (|l|) becomes the "McKeithan network", 

X\ = — kx\X2 + ^3X3 + k^Xi 

±2 = -kx\X2 + k 3 x 3 + fc 4 x 4 (8) 
i 3 = kx\X2 - (k 3 + (3 3 )x 3 
%4 = @3%3 — k^x^. 

I. Suppose that the output is given by h(x) = (xi,Xi)'. We can take the following observer: 

Z\= -kziz 2 + k 3 z 3 + Z4Z4 +(x\ — z\) 
z 2 = -kz\Z2 + k 3 z 3 + Z4Z4 
z 3 = kz\z 2 - (k 3 + I3 3 )z 3 

ZA= P 3 Z 3 -k4,Z4 +(£4-2:4). 

II. Suppose that the output is given by h(x) )' . Then, we can construct the 
following observer: 

Z\ = -kz\Z2 + k 3 Z 3 + Z 4 Z 4 +(xiX2 - Z1Z2) + (X1X4 - Z\Za) 

z 2 = -kz\Z2 + k 3 z 3 + z 4 z 4 +2(xi^2 ~ z \ z 2) 

z 3 = kziz 2 - (k 3 + (3 3 )z 3 

Zi= f3 3 z 3 -k±z± +(x\x± — Z1Z4). 

The remainder of this section will be devoted to proving this theorem. The basic idea is to 
study the stability properties of system ([]]) when a certain input is added to the function /, 
specifically, the system with right-hand side: 

r(z, U ):=f(z) + C'(u-h(z)). (9) 

We will show that, for the system thus obtained, an "input to state stability" condition holds. 
The observer for ([j]) is obtained by letting the input be u(t) = h(x(t)). 

The analysis of this system with inputs is interesting in its own right, since it provides a 
means of studying the behavior of the model under bounded inputs. 



2.1 An ISS Property 

The definition of an input-to-state stable (ISS) system was introduced in |]l^]. Here, we adapt 
this notion to deal with constrained inputs and relative equilibria, as well as positive states 
(i.e., those states with all coordinates in the strictly positive half-line). 

From now on, whenever we mention an input u(-), we will mean a measurable essentially 
bounded function u : [0, +00) — > MP, possibly restricted to take values in a set U of MP. For 
u : [0, +00) — > MP and any fixed u G M n , denote 

||n — n|| := ess. sup -{\u(t) — u\ : t > 0}. 
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Definition 2.2 A system z = f*(z,u), with input-value set U, evolving on a state space X 
which is an open subset of M. n containing K" > ^ s ^>o~ (forward) invariant if, for each initial state 
z(0) G M" an< ^ eacn U- valued input u(-), the corresponding maximal solution of z = f*(z,u) 
as a differential equation in X, which is defined on an interval J z (o\ u = [0,i max ), has values 
z(t) G R™ for all i G J z(0)i „. 

The system is (forward) complete if it is ^"-(forward) invariant and, for each z(0) G R™ 
and U-valued input «(•), J z (o), M = [0, +00). 

For the following definitions, fix points x G and ft G U, where U is a subset of R p . 

Definition 2.3 A system £ = f*(z, u), evolving on a state space X which is an open subset of 
W 1 containing M> > ^ s input-to-state stable with input-value set U (with respect to the point x 
and the input u) if it is R™ -complete and there exist a function (3 of class ICC and a function 
ip of class /Cqo such that, for each U-valued input u(-), and each initial condition zq G R> ' f° r 
all t it holds that 

\z(t)-x\ <P(\zo-x\,t) + <p(\\u-u\\). (10) 

To study the stability properties of the system z = f*(z,u) a Lyapunov-type technique is 
used, and the following definition is needed. 

Definition 2.4 An ISS-Lyapunov function with respect to the point x and input u, for the 
system i = f*(z,u) with inputs in U C MP, evolving on a state space X which is an open 
subset of M. n containing M> > ^ s a continuous function V : M> — > R, whose restriction to IR™ 
is continuously differentiable, which satisfies: 

(i) For z G R5 , V{z) > and V(z) = 0»z = i. 

(ii) The set {z G R> : V^(z) < L} is compact, for each positive constant L. 

(iii) There exist two functions a, 7 G /Coo such that 

W(z) f*{z,u) < -a{\z-x\)+j{\u-u\) 
for all u G U and z G M> . 



Remark 2.5 This definition differs slightly from other definitions of "ISS-Lyapunov" functions, 
such as given in [pif| . The difference is in the fact that here the function V is only required to 
be differentiable in the set M> > an< ^ ^ ^ s no ^ required to satisfy a decrease condition except 
at positive vectors. Observe that V is not proper when restricted to the positive orthant: it 
remains finite as the boundary of R™ is approached. 



Remark 2.6 For a function V defined as above, there always exist /Coo functions, Ui,V2, such 
that 

vi(\z-x\)<V(z)<v 2 (\z-x\) (11) 



for all z G M> (see Appendix AA for a construction of these bounds). 
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Next, we introduce our candidate ISS-Lyapunov function. Fix an x S E + , and define the 
following function: V : M> — > R: 



n / * n 

V(z) =Y,*i 9 (f;) =X] X ' 



In * + 1 - ^ 



(12) 



where 5 : R>o — > R, <?(?") = r lnr + 1 — r, with the convention that g(0) = 1. The function V is 
continuously differentiable on M™ and has the following properties: 

(i) It is positive definite on M™ with respect to x, i.e., V(z) > and V(z) = 44> z = x: 

g (r) = lnr 

so each ^(r) strictly increases on (l,+oo) and strictly decreases on (0,1). But since g(l) = 0, 
it must be that g{r) > for all r £ R>o- As a sum of nonnegative quantities, V(z) is also 
nonnegative and, moreover, 

V(z) = ^ g f^j = OVi ^ Zi = XiVi, 

that is, V(z) = 44> z = x. 

(ii) For each constant L > the set {z £ M> : ^(z) < L} is a compact subset of R> : 
y(z) < L implies g(zi/x~i) < L for all i; continuity of g on [0, +00) (since lim w ^o g(w) = 1) 
implies that z^ stays in a compact interval of M>o and hence 2; stays in a compact subset of 
R| . 

Finally, in Appendix [A.l| examples of possible functions ^1, 1^2 are given. 

In the next lemma, we show that the existence of an ISS-Lyapunov function (according to 
Definition |2.4| ) for a given system, implies that the trajectories of that system satisfy an ISS 
estimate. 

Lemma 2.7 Consider an R™ -invariant system z = f*(z,u), with input-value set U. Suppose 
that either: 

• the system is R" -complete, or 

• the state-space X contains R> - 

Fix a state x S R™ and input value u £ U. Suppose that that there is some ISS-Lyapunov 
funtion V with respect to x and u. 

Then, the system is input-to-state stable with input-value set U (with respect to the same 
x and u). 

Proof. This proof is very similar to what is done in the case of the usual definition of an ISS 



system (see [21|, for instance). 

According to the definition, V satisfies an estimate of the form 

VV(z) f*(z,u) < -a(\z-x\)+^(\u-u\), 

for each z £ R> arm each u in the input value set U, where a, 7 G /Coo. On the other 



hand, we know from Remark |2.q that there exist two class /Coo functions, u±,U2, such that 
v\(\z — x\) < V(z) < v-z(\z — x\). Introducing a new /Coo function x given by x(^) = a _1 (27(r')), 
observe that: 

\z-x\> x(\u - u\) ^ a(\z - x\) > 2j(\u - u\) => VV(z) f*(z,u) < -^a{\z - x\). (13) 
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Now pick any initial condition zq G K>o, and an input u : [0, +00) — > U, and consider the 

corresponding maximal solution z(t) denned on J = [0,i max ). For s = v 2 (x(\\ u — u\\)), define 
the following sublevel set of V: 

S = {t€R$v:V(t)<8}. 

Claim. Suppose there exists an instant a G J such that z(<r) G S. Then z(t) G S* for all 
a <t < i max . 

To see this, argue by contradiction: suppose there exists at>a (but t < t max ) and an e > 
such that V(z(t)) > s + e. Let 

r = inf{t > o : V(z(t)) > s + e}. 

Then z(r) ^ 5 which implies V(,z(t)) > ^(xdl - " ~~ ^ll))> an d therefore, since V(z) < u 2 (\z — %\) 
and v 2 is strictly increasing, 

v 2 (\z(t) - x\) > v 2 {x{\\u - u\\)) <{=>• \z(t) -x\> x{\\u - u\\). 

By (|l|), ^V(z(t))\ T < 0, implying that V(z(Q) > V(z(r)) for some t* G (cr,r), and thus 
contradicting minimality of r. So the claim holds. 

Now, let 

T = inf{<r : z(ct) G 5} 

(with T = +00 if the trajectory never enters S). We have two cases to consider, for each t > 0: 
For t > T: V(s(i)) < Mx(h ~ "ID) implies 

i*(|z(t) - x|) < K2(x(ll« - «ID) => - *l < ^r 1 ^2 o X (\W ~ "ID, 

so we let 93 = j^f 1 o z>2 X- 

For t < T: V(z(t)) > ^2 (x (11^ — ^ll))> which implies \z(t) — x\ > x(\\ u ~ and hence 

^V(z(t)) < - l - a (\z(t) - x\) < -l a [u 2 (V(z(t)))]. 

By a standard comparison principle, there exists a function j3 G ICC (which depends only on a 
and u 2 ) such that V(z{t)) < (3(V(z ),t) for all t < T. Then 

\z(t)-x\ <v^(p(V(z ),t)) <u^0(u 2 (\z o -x\),t)) := P(\z - x\,t). 

Thus, for all t G J, 

\z(t) - x\ < max{/?(|z - x\,t),cp(\\u - u\\)} < (3(\z - x\,t) + tp(\\u - u\\) 

where j3 G ICC and ip G /Coo- 

Under the assumption that the system is M™ -complete, we know that J = [0, 00), and thus 
this inequality holds for all t > 0. 

If, instead, we assume that X contains M> , we argue as follows. The above ISS inequality 
implies that {z(t),t G J} is bounded (by 0(\zo — x\,0) + (p(\\u — u\\)) and is a subset of K> . 
Thus, the maximal solution remains in a compact set of X for all t in the maximal interval J, 
which implies that J has to be all of [0, +00). I 
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2.2 Proof of Theorem g 

Lemma 2.8 Let f*(z,u) = f(z) + C'(u-h(z)) with C such that V + imC = R n . Let x denote 
any point in E+, and 9 be any real number with < 9 < 1. Define the following subset of MP: 

U e = {ueR p : \u k - h k (x)\ < -h k (x), k=l,...,p}. 

Then, for the function V defined in (|T2|), there exist functions a = ctj, 7 = 7^ of class /Coo such 
that 

VV(z) f*(z, u) < -a(\z -x\) + 7 (|n - h(x)\), 
for all z e M> and all u £U e . 

Proof. Pick any x £ E + and any < 9 < 1. We have 

VV(z) f*(z,u) = (p(z) - p{x)J{z)) + (p(z) - p(x),C'(u - h(z))). 

Using the notation g = p(z) — p(x), notice that the second term on the right can be rewritten 
as: 

{Cg, u — h(z)) = (Cg, u — h(x)) — (Cg, h(z) — h(x)). 

Introducing the notation a = Cg, p = h(z) — h(x) and v = u — h(x), the expression for 
W(z) f*(z,u) becomes 

VV(z)f*(z,u) = (gj( z ))-(a,p) + (a,v) 
= P(z,x) + R(z,u,x) 

where 

P(z,x) = (gJ(z))-(l-9)(a,p) 

and 

R(z, u, x) = -6(a, p) + {a, v) = ^ ai{-6pi + u f ) 

We now bound each of these terms. 

Step 1. We show first that R(z,u,x) < C3 1 v | 2 , for some positive constant C3. 
Notice that piOi = (hi(z) — /ij(x))(ln hi(z) — lnhi(x~)) > for all pairs hi(z),hi(x). 

(i) if 9\pi\ > \vi\, then immediately 

Oi(-9pi + Vi) < 0. 

(ii) if 9\pi\ < \vi\, then 

2 c L 

(Ti(-9pi + Vi) < 2\(Ti\\Vi\ < -0—\ v i\ 2 

where the last inequality follows from the bounds on it: 

9 11 

\vi\ = \ui - hi(x)\ < -hi(x) \hi(z) - hi(x)\ = pi < -\vi\ < -hi(x), 

so that hi(z) > hi{x)/2, and with a Lipschitz constant, cl, of the logarithmic function on 
[A, +00), when A = mm{hi(x)/2 : i = 1, . . . ,p} (for instance, cl = l/A): 

\<Ti\ = I lnhi(z) - lnhi(x)\ < c L \hi(z) - hi(x)\ = c L \pi\. 
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In either case, 

R(z,u,x) < ^p^N 2 

as wanted. 

Step 2. Show that P(z,x) < —a(\z — x\), where a £ K. c 
Recall a result from p0|, namely: 



(p(z)-p(x),f(z)) < - c (x)^2^2(b l -b J ,p(z)-p(x)) 2 

i=l j=l 

< -c(x)\D (p(z)-p(x))\ 2 (14) 

where -Do is an (m — 1) x n matrix whose rows (transposed) constitute a basis of T>, and c(-) is 
a positive function. Using this estimate yields: 

P(z,x) < - Cl \D g\ 2 - (l-0)(a,fj) 

where c\ = c(x). By assumption, T> + imC = M n , so we know that 

span {cols(D )} + span {cols(C')} = M n , 

so the matrix D = [D' Q C']' has rank n. 

Next, note that combining Lemmas A. 2 and [A.3| , it is enough to find two /Coo functions 
«i, «2 such that, 

P(z,x) < -a 1 (\D g\)-a 2 (\Cg\). 

We may simply take ai(w) = c\w 2 and will next produce a 2 ■ 

Define, for each fixed z, the following finite, disjoint sets of integers 

/+ = I+(z) = {i : hi(z) - hi(x) > 0} 

and 

I_=I_(z) = {i:hi(z)-hi(x)<0}. 

Write for each i, 

hj(z) - hj(x) 
hi{x) 

It is not difficult to check that 



In hi(z) - \nhi(x) + g(hi(z),hi(x)). 



has a minimum value of 0, achieved whenever hi{z) = hi(x), and otherwise is strictly positive. 
Let Ci denote the i-th row of the matrix C and let 

7-7^- = ii + Ci and Ui = CiQ = ^ + r/j 
hi{x) 

with 

Ui = lnhi(z) — lnhi(x) = CiQ, if i G 1+ 
Hi/hi(x) = (hi{z) - hi(x))/hi(x), if i G /_ 



12 



g(hi(z),hi(x)), ifi€i+ 



0, 



if i € /- 



0, if i € 1+ 

-g(hi(z),hi(x)), iiiel-. 



By construction, (£j + Ci)(C» + ^i) — £f i since all cross terms are nonnegative: 



g(hi(z),hi(x))(lnhi(z) - lnhi(x)) > 0, if i € I-} 
0, if i € I- 



0, 



if i e J4 



-g(hi(z), hi{x)){hi(z) - hi(x))/hi(x) > 0, if i € Z_ 
and CiVi = f° r an *■ Therefore, for each 

= hi(x)(C ie ) 2 , 



and for each i 6 I_ 



hi{x)CiQ 

hi(x)\CiQ\ 

hi{x)\CiQ\ 

hi{x)\CiQ\ 

hi{x)\CiQ\ 
hi{x)\CiQ\ 



hj(z) - hj(x) 

hi(x) 
hi(z) - hi(x) 



hi(x) 



hi(z) 



hi(x) 



1 



In I 



hAzy 



e V h i( x ) J — 1 

e [ln/ij(«)-lnftj(s)] _ y 
e -\C ia \ _ ! 



since Cj£ = ln/ij(z) — ln/ij(x) < for % E i_. 

In summary, with C2 = min{/ii(x), . . . , /i p (x)}, 



-(1 - 9)(a, M ) < - C2 ( 2 |C^| 2 + £ \C W \ (l - e-l c ^l) J , 
\iei+ ieJ- / 

and since both functions o(r) = c 2 r 2 and 6(r) = c 2 r(l — e _r ) are of class /C^, one may apply 
Lemma [A. 3] repeatedly to obtain a 2 G /Coo such that — (1 — 0)(a, /z) < — a 2 (|C£>|). 

So, taking a < ai + a 2 according to Lemma A. 3, and 7(7*) = C3 r 2 , gives the desired result. 
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Proposition 2.9 Suppose that the system defined by (||) and (||) is detectable. Consider the 
system with inputs 

z = r(z,u):=f(z)+C'(u-h(z)) (15) 

with state-space X = W 1 . Then, the system is M™ -invariant with input- value set M> . Fur- 
thermore, let 9 be any real number with < 6 < 1, and pick any fixed state x £ E+. Let 



be the subset of W defined in Lemma |2.8| . Then, the system (|15|) is ISS with input-value set 
Vq (with respect to the point x and the input u = h(x)). 

Proof. The proof of the first statement, namely that the system is M™ -invariant with input- 
value set K> , is fairly routine, and it proceeds as follows. 

Given an initial condition z(0) £ ^>o> an d an Unvalued input, let z(t) be the maximal 
solution of (15), defined on a (maximal) interval J. Let X = [0, +oo). 

Assume that one of the coordinates becomes < at some instant and define 

to = inf{t £ J : z k (t) = for some 1 < k < n}. 

Pick one coordinate k such that z k (to) = 0. We reorder variables, singling out this coordinate, 
and look at the time-dependent differential equation that results by fixing the remaining n — 1 
variables. It is useful for that purpose to introduce the following notation: 

(z(t),x) = (zi(t), z k -i{t),x, z k+1 (t), z n (t)). 

In addition, we wish to see the obtained scalar equation as well-defined for all t, not just t < to- 
So we construct a new function F : I x R -> 1 as follows: 



F(t,x) 



ft(2(t),x;u(t)), t€[0,to) 
f^{z(t ),x;u ), te[t ,+oo) 



where uq is any fixed element of Vg. Then, for each fixed t, F(t, x) is locally Lipschitz in x and 
the Lipschitz constants, a(t), are uniformly bounded (and hence locally integrable as a function 
of time). In addition, for each fixed x, F(t,x) is measurable as a function of time. Thus the 
standard existence and uniqueness conditions apply. 

Claim. F(t, 0) > for almost all t £ J. 



To prove this, write 



j k {z,x,u) — ) j ) j a iJ z 1 ■■■z k _ 1 z k+1 ...z n o 
i=l jeA 



+ EE ^ ■ ■ ■ i-f^^i:?^ ■ ■ ■ ^'(^ - M (is) 

i=l j&A+ 

p 

+ ^Cjkiuj - hj(z,x)], 
3=1 

where A = {j : b k j = 0} and A + = {j : b kj > 0}. 
For x = and t £ I: 

(a) the third term is nonnegative since we are assuming that Cji > and Uj > for all 
and because hj(z, x) = z\ jl . . . x c i k . . . z n 3 ™, so either Cj k = 0, or Cj k > and hj(z, 0) = 0. 

(b) the second term is zero since x = 0; 
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(c) the first term is nonnegative since, by definition of to, we are evaluating at z\ = Zi{t) > 0, 
for all i and t < to, and Zi = Zi(to) for t > to- 

This proves the claim. 

Moreover, notice that, for all t < to, the scalar variable Zk{t) satisfies the initial value 
problem 

x = F(t,x) 
x(0) = z fc (0), 

where F(t, 0) > for all t > 0. Solutions of this initial value problem exist on an open interval 
J, and this interval contains [0, to] because Zk(t) solves the equation in that interval. Then, by 



Lemma AA, x(t) > on J and Z}~(t) = x(t) > for all t < t ; since both x(t) and Z)~(t) are 
continuous functions, we also have that £ft(io) = x(to), contradicting the fact that z^ifo) = 0. 

This concludes the proof of M" -invariance with input- value set K> . As this implies 
that ( |i~5| ) is also IR" -invariant with (the smaller) input-value set Ug, to prove that ([l5|) is 
ISS with input- value set U#, it is enough, by Lemma |2.7| , to show that this system admits the 
function V defined in ( |i~2| ) as an ISS-Lyapunov function with respect to the state x and the 
input h{x). 



So we must check that V satisfies Definition 2A. Properties (i) and (ii) have already been 



shown in the discussion following formula (12). Property (iii) follows from Lemma 



Lemma 2.10 Let f*(z,u) = f (z) + C (u — h(z)) and let V denote the function defined in ( |i~2"l) 
For each fixed x £ E + , and each constant ii max > 0, there exists a constant Cs, Umax such that 



^ max I 



VV(z) f(z,u) <c SjUmm , VzeR^o, V«e[0, M „ 

Proof. Pick any x S E + and any nonnegative n max . Then, using estimate (JH|), 

VV(z)f*(z,u) = (p(z)-p(x)J(z)) + (p(z)-p(x),C'(u-h(z))) 

< -ci\D (p(z) - p(x))\ 2 + (p(z) ~ P(x) } C'(u - h(z))) 

< (C(p(z)-p(x)),u-h(z)) 

V 

:= ^2si(z,u,x) 
i=l 

where Si(z,u,x) = (lnhi(z) — \nhi(x))(ui — h,i(z)). 

Now, let /i max be a constant depending only on x such that, maxj = i v .. jP | h\hi{x)\ < h n 
For each fixed z, define the following finite, disjoint sets of integers 

/+ = I+(z) = {i : hi(z) > 1} 
/_=/_(*) = {i : hi(z) < 1}. 

Clearly J + U i_ = {1, . . . ,p}, and for each i G J_, 



Uilnhi(z) < 

\uihxhi(x)\ < u m&x \lnhi(x)\ < u max h n 

\hi(z)\uhi(z)\ < - 

\hi(z) In hi(x)\ < \ ln.hi(x)\ < h max , 



15 



so that, for the corresponding iih term in the above sum: 

Si(z, u, x) = Ui In hi(z) — Ui In hi(x) — hi(z) In hi(z) + hi(z) In hi(x) 

^ -\- w max /i max -|- -\- h maK . 

e 

On the other hand, for each i G I+, Si can be decomposed into two terms 

-(lnhi(z) - In hi(x))(hi(z) - hi(x)) + (ln/ij(z) - \nhi(xj){ui - hi{x)) 

the first of which is always negative. Since hi{z) > 1 there is a Lipschitz constant cl = cl{x) 
such that | ln/ij(z) — \nh; L {x)\ < ci\h; L {z) — hi{x)\ for all z such that hi(z) > 1. So we have, 

JO, if \ui - hi(x)\ < \h,i(z) - hi(x)\ 

Si {z,u,x) _ | _ ^ ( _ )|2; . f ^ _ ^ > ^ _ 

In either case, we may just write 

Si(z,u,x) < 2c L (nL x + /iLx) 
whenever i £ I + . Then, with Cx, Umax = pmax{u max /i max + 1/e + fr max , 2ci(u^ aj£ + fr^)}, 

W(z) /*(*,«) <c*, Umax , V Z E1R™ , VnelO,^ 
as we wanted to show. 



"maxj j 



We also have the following M™ -completeness result (notice that Corollary p. 11 would follow 
immediately from Proposition Ell in the case of an input u : [0, +oo) - 



Corollary 2.11 Under the assumptions of Proposition 2J), system ( |l5|) is M"g-complete with 

v 

>o- 



input- value set m '" 



Proof. Suppose that u : [0, +oo) — > [0, u max ] p , and pick any initial condition z(0) € ^>o- We 
already know, from Proposition \2.9[ that system (jl^) is M" -invariant with input- value set M> . 
Suppose that the maximal interval of existence would be [0, i max ) with i max < +oo. 



From Lemma 2.10 we have that 
d 



d V(z(t)) = VV(z) f*(z,u) < c,, Umax , Vt. 



So V(z{t)) < V(z(0)) + c SjUmax t max . Since V is proper (property (ii) of Definition 2.4), we 
conclude that z(t) belongs to a compact subset of the state space M n , a contradiction with 

tmax < OO. I 

Proof of Theorem Pick any initial states x(0) £ R™ and z(0) G K™ of the original 
system ([j]) and the observer, respectively. We let w(-) = (x(-),z(-)) be the maximal trajectory 
of the composite system 

x = f{x) 

z = f(z) + C'(h(x)-h(z)), 

which we also write as w = g(w), with initial condition (x(0), z(0)). We need to show that 
w(t) = (x(t), z(t)) is defined for all t > 0, and \z{t) — x(t)\ — > as t — > +oo. 

Since we know that x(t) is defined for all t > and converges to some equilibrium x as 
t — > +oo, we must prove that z(t) is also defined for all t > and converges to this same x as 
t — > +oo. 
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Fix to be any fixed constant such that < 6 < 1 and (since x(t) converges) let T be such 
that 

t>T => \hi(x(t)) - hi(x)\ < -h(x) 

for all i = 1, . . . ,p. Let Vg be the set of vectors u such that \u{ — hi(x)\ < 0hi(x)/2. Corol- 
lary p. 11 shows that the solution z(t) exists for all t > 0. Now, Proposition |2.9| , applied with 
«(i) = h{x{t + T)) and u = h{x) (note that h(x(t)) G U# for all £ > T), shows that, for all 
t > T, 

\z(t) -x\< p(\z(T) -x\,t) + <p(\\h(x) - h(x)\\). 

where (3(r, t) — ► for each fixed r and i^(r) — > as r — > 0. 
Now, given any e > 0, let T\ be such that 

^\\h(x)-h{x)\\i)<l 
where \\h{x) — h{x)\\\ = ess. sup ,{\h{x{t)) — h{x)\ : t > T{\ (such T\ exists because 

\h(x(t)) - h(x)\ 

as t — » +oo). 

Next, choose T2 > T\ such that 

/?(|z(Ti)-a;|,t) < |, Vt>T 2 . 

Then, rechoosing T (if necessary) to be larger than T 2 we have that, for all t >T, \z(t) — x\ < e. 
Therefore, z(t) — > x as t — > +00 as wanted. I 

Remark 2.12 The observer (0) can be slightly modified to the form 

z = f{z) + C'W(h(x)-h(z)) (17) 

where W is any positive definite, diagonal p x p matrix. This allows more flexibility when 
choosing the gain matrix, namely, it is possible to assign different weights to each component 
of h(x). The conclusions of Proposition still hold for (|i~7|). Since W = diag(ioi, W2, ■ ■ ■ , w p ) 
with Wi > and W = S'S for S = diag(- v /u7i, y/wi, . . . , ^/w^), M> -invariance is guaranteed by 
the same argument given in that Proposition. Expression (|l6D becomes 



fk(z, x; u) — E E a «i z i 1J • • • z k-i h ' z k+i h ■ ■ ■ Zn^bki 

+ EE ^ • • • i-^^il^ ■ ■ ■ ^{bu - b k3 ) 

i=l jeA+ 
p 

+ ^Cj ky /w^[uj-hj(z,x)\, 
3=1 

which is still nonegative when x = 0. 

The function V in ( [i2| ) is also an ISS-Lyapunov function for (|17]). To check property (iii) 
we compute (with g = p(z) — p(x) and < 6 < 1): 

VV(z)f*(z,u) = {g,f(z)) + {g,C'S'S( U -h(z))) 
= (g,f(z)) + (SCg,S(u-h(z))) 
= P(z, x) + R(z,u, x) 
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where 



P(z, x) = (g, f(z)) - (1 - 9){SCq, S(h(x) - h(z))) 
R(z, u, x) = (SCg, S(u - h{x)) - 8(SCg, S(h(x) - h(x))). 

Using the notation a = SCg, \x = S{h(z) — h(x)) and v = S(u — h(x)), steps 1 and 2 of the 



proof of Lemma 2.8 still hold, and we obtain: 



R(z, u, x) < c\v\ 2 
P(z,x) < -ai(\D g\)-a 4 (\Cg\). 



The function a\ G /Coo is as in Lemma 2J3. The function 04 G /Coo and the constant c are 
similar to Q2 and C3, respectively, of that Lemma: c = 2cl/6, where cl is a Lipschitz constant 
of the logarithmic function on [A, +00), when A = mm{y/wihi(x) /2 : i = 1, . . . ,p}; the function 



04 is obtained by repeatedly applying Lemma A. 3 to the expression on the left: 



C2 



^2 \C t g\ 2 + 52 \CiQ\ (l " e- |C ^) ] > a A (\Cg\), 
iel- 



since both functions a(r) = C2 r 2 and b(r) = cir(\ — e r ) are of class K.^. The multiplicative 
constant is given by C2 = m.in{wihi(x), . . . ,w p h p (x)}. 

With an ISS estimate for the difference \z(t) — x\ and the completeness result, the same 
proof above shows that ([17]) is an observer for system (|l|). 



2.3 A Remark on Observation Noise 

The ISS estimate obtained for the observer allows us to conclude that the observer is robust 
with respect to small observation noise. We sketch this next. If the input to the observer 
is u(t) = h(x(t)) + n(t) instead of h(x(t)), then the same conclusions hold regarding global 
existence of trajectories (at least, provided that h(x(t)) +n(t) is nonnegative) . In addition, for 
large t (so h(x(t)) h(x) = u), the term <p(\\u — u\\) in the ISS estimate becomes approximately 
<£>(||n||); thus one obtains an asymptotic estimate on z(t) which is a IC function of the noise level, 
and is in particular small when n is small in magnitude. 



2.4 A Separation Property 



A direct proof of Theorem || can be given using a Lyapunov function for the composite system 
(plant and observer), which is entirely analogous to the "separation theorem" proof used to 
establish the convergence of Luenberger observers. Let us sketch the procedure here. 

Consider the error e = x — z and look at the composite system in new coordinates x and e 
instead of x and z: 



x 

€ - 



f(x) 



fix) - f(x - e) - C'(h(x) - h(x - e)) . 
Now consider the following function: 



V 2 (x,e) = V(x-e) 



E 

8=1 



Xi 



— In — 



+ 1 
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Using arguments similar to those employed earlier, we know that 



£v 2 (x(t), e(t)) < -a(\x -e-x\)+ c 3 \h(x) - h(x)\ 2 
at 

for all x = x(t) such that \hk(x) — hk(x)\ < hk j^' for all k = 1, . . . ,p. (here we are taking the 
number 6 from Proposition |2.9| to be equal to 1/2.) Since x(t) — ► x as t — ► +oo (Theorem [l]), 
there exists some T such that this inequality holds for every t > T. 

Since e = x — z and we have shown that both solutions x(t) and z(t) exist for all t > 0, we 
may conclude that a solution e(t) to the e-equation also exists for all t > (for a direct proof 
of this fact, one may use a similar argument to that of Proposition ^9| ). 

Recall that we already have a Lyapunov function, V\{x) = Yl?=i %i 9( x i/ x i)> f° r the system 
x = f(x), which satisfies an estimate of the form (see p0| ): 



^(z(t)) < -c{x)8{x,x) 

where 

5(x, x) = c\\x — x\ 2 + d(\x — x\ 2 ) 
and d(r) = o(r) as r — > 0. Pick e < |x|/2 such that 

\d(\x — x\ 2 )\ < -^\x — x\ 2 V \x — x\ < e . 

Now choose the time T = T(e) so large that 

h(x) 

\x(t)-x\<£ and \h(x(t)) - h(x)\ < for alH > T. 



So, for t > T, we also have that x(t) evolves in a compact set B{x) = {a 6 M™ : |o — x| < |x|/2}, 
and hence there exists a Lipschitz constant, c^, (take, for example, = max |§§ t (^)| with the 
maximum taken over j = 1, . . . , n and k = 1, . . . ,p and x £ B(x)) such that 

\h{x) — h(x)\ < ci\x — x\, Vx£ B(x). 

Thus for t > T, the following estimates hold: 



c\c(x) 



X — X\ 



dt 

f t V 2 (x(t), e(t)) < -a(\x -e-x\) + c 3 c 2 L \x - x\ 2 . 

This means that 



W(x,e) = 2ic3C \ + 2) Vl (x) + V 2 (x,e) 
cicyx) 



is a Lyapunov function for the system S e , with 

—W(x(t), e(t)) < -a(|x - e - x|) - 2|x - x| 2 , 

for all t > T. 

It is not difficult to verify that this estimate can be rewritten as 

—W(x(t), e(t)) < -a(\e\ 2 + \x- x\ 2 ) 
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for some other a G K,^. (First note that, with «2(c) = c 2 and using Lemma A3, there exists 
0(3 G /Cqo such that 



a(\x — e — x\) + a 2 {\x — x\) > a^(\/\x — e — x\ 2 + \x — x\ 2 ) 



Z 1 , _, 1, 

> «3 ( -\x - e - x\ + -\x - x\ 

> «3 I „|e| J ■ 



And then, using Lemma [A.3| again, as(|e|/2) + a 2 {\x — x\) > a3(i/|e| 2 + \x 



x 
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Next, notice that W(x,e) < -^^p-^dx — x\) + u 2 (\x — e — x\), where v 2 G K-oo is as 



in (PI), so immediately 

W(x, e) < ^ 2 (2 v / |e| 2 + |3;-x| 2 ) := P 2 (|e| 2 + |» - ^| 2 ), 
where k = 1 + 2 ( C3C f^ . Then 

|^(x(t),e(t))<-d(z> 2 - 1 (W(x,6))) 
which implies that W(a;(t), e(t)) — > as t — > +00 and so (x(t),e(t)) — ► (x,0), as wanted. 



3 An Alternative Observer 

In this section we construct an alternative observer, which works under the same necessary and 
sufficient conditions for detectability. As mentioned in the Introduction, this observer exhibits 
slower convergence but this is traded for more robustness to additive disturbances. Among 
other interesting features, the observer state never becomes negative, even under arbitrary 
disturbances, which is a natural requirement on physical grounds (since the state being observed 
is always nonnegative) . This nonnegativity is enforced by a "barrier feedback" reminiscent of 
optimization techniques. 

We now use logarithms of the outputs as input to the observer. 

Theorem 4 Consider system (|l]) and assume that it is detectable. Then the following system, 
with state-space X = M> ' i s an observer for system (|]): 

z = f{z) + C'{p{h{x))-p{h{z))). (18) 

We restricted this to a system with state-space so that the logarithms are well-defined. 
Using the convenient notation 

H(x) := [H x {x) H 2 {x) . . . H p (x)}' = Cp{x) (19) 

where Hi(x) = en In x\ + ■ ■ ■ + Ci n lnx n , the observer can be written 

z = f(z) + C'(H(x)-H(z)). 
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Example 3.1 Consider again the system in Example (2.1). Suppose that the output is given 
by h(x) = (xix 2 , X1X4)'. Then H{x) = (lnxi + 21nx2,lnxi + lnx4) and we can construct the 
following observer: 



Z\ = —kz\z<i + + +(lna?i + 21nx2 — ln^i — 2 In 22) 

+ (ln x\ + In X4 — In z\ — In £4) 
Z2 = —kz\z<i + + Z4Z4 +2(ln xi + 2 In X2 — In zi — 2 In Z2) 
z 3 = kz\Zi - (k 3 + f3 3 )z3 

Z4 = P3Z3 — k^z^ +(lnxi + lnx4 — lnzi — I11Z4). 

The proof of this theorem involves a technique similar to the one used in the proof of 
Theorem ||, by studying the stability properties of the more general system 

z = f*(z,u) :=f(z) + C'(u-H(z)), 

and showing that it satisfies an "input to state stability" condition. The observer for ([]]) is 
obtained by letting the input be u(t) = H(x(t)). 

Lemma 3.2 Let f*(z, u) = f(z) + C\u - H(x)) and assume C is such that V + im C = W 1 . 
Let x denote any point in E + and V be the function defined in (|l2|). Then there exist class 
functions a = a s and 7 = 7^ such that 

VV(z) f*(z, u) < -a(\z -x\)+ 7 (|u - H(x)\), 

for all z G M" and all ueR p . Moreover, one may pick 7(7-) = r 2 /2. 

Proof. Pick any x G E + . We have (using the notation g = p(z) — p(x)): 

VV(z)f*(z,u) = (p(z)-p(x),f(z)) + (p(z)-p(x),C'(u-H(z))) 
= (g,f(z)) + (C e ,u-H(z)) 
= (g,f(z)) + (Cg,u-H(x))-(Cg,Cg) 

where the definition H(z) = Cp(z) was used, as well as the equality u — H(z) = u — H(x) + 
H{x)-H(z). 

Now, using result ( |l~4| ) and then Cauchy-Schwartz and the inequality ab < \(a? + b 2 ): 

VV{z) f*(z,u) < -c(x)\D g\ 2 - \C g\ 2 + (Cg,u- H{x)) 

< -c(x)\D g\ 2 -±\Cg\ 2 + ±\u-H(x)\ 2 

for all z G and all u G M p , where -Do is an ( m — 1) x n matrix whose rows (transposed) 
constitute a basis of D, and c(-) is a positive function. By assumption, we know that the matrix 
D = [D' C'Y has rank n. So, for all z G M™ and all u G W, 

VV(z) f*(z,u) < -mmL(x),^\\Dg\ 2 + ±\u-H(x)\ 2 

< -a{\z-x\)+j(\u-H{x)\), 

where a(r) = min{c(x), l/2}a(r) and a is as in Lemma [A.2j (with S = {x}), and j(r) = | r 2 . I 
We will consider a system with inputs 

z = r(z,u):=f(z) + C'( U -H(z)). (20) 
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Since 



n p 



C'{u - H(z)) = Cifc(uj - Hi(z))e k , 



k=l i=l 

we may introduce the set of indices 

X = {/c : 3i, 1 < ? < p with Cjfc 7^ 0}, 
which is the complement of all coordinates for which ft(z,u) = f k (z) (i.e., all k such that the 



term In 2% appears in /£ are contained in K). When considering the system (20), we view it as 
evolving on the following state-space: 

X = X\ x • ■ ■ x X n 

where X^ = R if k ^ K and X k = (0, +00) if k G K. Observe that "the term lnzj appears 
anywhere in /* if and only if it appears in /*", so the largest open set where /* is well defined 
is indeed X . 

Proposition 3.3 Assume system (|l]), defined by (g) and (||), to be detectable. Then the 
system (|20| ) is K" -complete with input-value set W . 

Furthermore, pick any x E E + and let k be a positive constant. Define U K to be the subset 
of MP consisting of vectors u which satisfy |w| < k. Then, the system (p0|) is ISS with input-value 
set U K (with respect to x and the input u = H(x)). 

Proof. Note that, for each input u : [0, +00) — > W, /*(•, u) is of class C 1 and locally Lipschitz on 
X for each fixed u, and f*(z, ■) is measurable and locally integrable on X for each fixed z. Then, 
given an initial condition zq and a fixed input u as above, a unique maximal solution of ( ^0|) 
exists on a maximal interval, J. Observe that /* does not extend to z^ = and k £ K, but 
while Zk(t) € M™q no difficulties exist. Then J is not empty, and we will show that J = [0, +00). 

Step 1. To prove R" -invariance (with input-value set MP), we only need to show that, for 
each coordinate k with k ^ K, 

z k (t) > 0, V t € J, 

since all the others satisfy this condition by definition of Af. 

Assume that one of the coordinates is < at some instant and define 

to = inf{t G J : Zfc(t) = for some k ^ K}. 

Pick any coordinate k such that z k (to) = 0. 

Note that for every k ^ K, f£(z,u) = fk(z), so using the notation 

(z(t),x) = (zi(t), . . .,z k -x(t),x,z k+1 (t), . . .,z n (t)), 

we can construct a new function F : [0, 00) x M — ► M: 



F(t,x) 



/ fc (i(t),2;;u(t)), te[0,t ) 
f k (z(t ),x;u ), te[to, +00) 



(arbitrary tio in U re ). We have that standard existence and uniqueness results apply to x 
F(t,x). 

Claim. F(t,0) > for almost all t £ X. 
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To prove this, write 

m 

fk(z, X] u) = ^ ^ ^ ' Q-ij z \ ^ ■ ■ ■ Zf}_^ ' z k+l ' ■ • • Zn ' ^ki 
i=l jeA 
m 

+ EE ■ ■ ■ i-i l)j ^^r )j (hi - b kj ) 

i=l j&A+ 

where A = {j : b k j = 0} and A + = {j : b k j > 0}. 
For x = and t G X: 

(a) the second term is zero (since x = 0); 

(b) the first term is nonnegative since, by definition of to, Zi(t) > 0, for alH ^ k and i < to, 
and either F(t,0) = /fc(3(t), 0; u(t)) for t < t , or F(i,0) = fki.z(t ), 0; it(i )) for f > t . 

This proves the claim. 

Now, notice that the initial value problem 

x = F(t,x) 
x{0) = z k (0) 

has a solution x(t) in some open interval, J, which contains [0, to], and moreover, for all t < to, 
that solution coincides exactly with the trajectory of the /c-coordinate: x(t) = z k (t). Also, by 
continuity of both x(t) and z k (t), it must be that a; (to) = -^fe(to)- 

From Lemma [A.4| , with X = R and J xo = J it follows that x(t) > for all t £ J, and hence 
Zfc(t) > x(t) for all t < to, which contradicts the fact that -Zfc(to) = 0. 

To conclude, z k {t) > for all k ^ K and all t £ J, and thus the solution z(t) never leaves 
the set R> in the maximal interval J, proving R™ -invariance with input- value set W. 

Step 2. Now assume (to get a contradiction) that the interval J is finite, J = [0, t max ), 
tmax < +oo. From Lemma |3.2| , and from boundedness of u, we know that the derivative of 
V(z(t)) satisfies 

— V{z(t)) <\u- H(x)\ 2 <k 2 + \H{x)\ 2 = c, 

so 

v(z(t))<v(o)+ [ cdt < y(o) + ct max . 

J 

Since 1/ _1 ([0, V(0) + ct ma J) is compact, this inequality implies that z(t) remains bounded for 
all t £ J. (This does not yet establish that J = [0, +oo), since our state-space is X, not 
R ra ; trajectories could conceivably approach the boundary of the positive orthant for some 
coordinates in K.) 

Let b £ R be a (strictly) positive constant such that z(t) £ [0, b] n for all t £ J. Choose 
strictly positive constants e k for each k £ K such that: 

v v v 

ft ( z -> u ) = fk{z) + E °ik u i ~ E E CikCi i ln Z J ~ E c ^ ln Zfc > °' 

i=l i=l j^=k i=l 

whenever each takes values in [— ||u||, ||u||], z is a vector in (0, b] n and < 2e^. It is possible 
to make this choice of e k , because we are assuming C{j > for all i,j and we know that u is 
bounded and also that < Zj < b which imply the existence of positive constants a% such that: 

v p 
\fk(z)\ < ai, y^qfcl^l < a 2 , and ln ^' < a3- 

8=1 i=l j't^/c 
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Thus, it is enough to take e k such that 

1 

2_/i=l c ik 



ln(2e k ) < — —( a i + 0-2 + (13) 



(Note that Yli=i c lk ^ precisely because k € K.) If the value of e k thus obtained is greater 
than the initial condition z k (0), we should rechoose e k to get e k < 2fc(0). 

Now, each coordinate function z k (t), for k £ K and t G J satisfies an initial value problem 
of the form 

x = 2) 
x(0) = z fc (0) > 0, 

where 

F(t,x) := fk(zi(t), . . .,z k -i(t),x,z k+ i(t), . . . ,z n (t),u(t)) 
is locally Lipschitz on R>o for each fixed t and measurable on X for each fixed x, and moreover 



F(t, x) > whenever x < 2^^. Then Lemma |A.5| applies and we conclude that 



Z k (t) > E k , V t € J. 

Putting these together with the conclusion of step 1, we have that trajectories of ( p0| ) stay 
inside the set 

C = [£!,&] X ••• X [E n ,b]. 

(where £ k = if k £ K), during the maximal interval J. 

Since C is a compact subset of the domain of definition, X, and since f*(z,u) (when seen 
as an explicit function of time through u = u(t)), is defined for all t £ [0, +00) x X, it follows 
by standard ODE arguments (e.g., Proposition C.3.6 in |17]|) that the maximal interval is 
J = [0, +00) nl = [0, +00), contradicting the existence of a finite t max . Hence, J must be the 
nonnegative half-line, proving IR™ -completeness with input-value set U K . 

Step 3. Finally, notice that the function (^) is in fact an ISS-Lyapunov function with 
respect to the point x and the input H(x) for the system (p0|), because all three properties of 
Definition [2.4| are satisfied ((i) and (ii) we have previously shown and (iii) follows immediately 
from Lemma [T^). 

It follows from Lemma |2.7| that ( ]20j ) is ISS with input-value set U K (with respect to the 
point x and the input H(x)). I 



Remark 3.4 Proposition 3.3 showed that the given system is ISS with respect to bounded 
inputs, for each possible bound on inputs. It would be more interesting (although, not needed 
for our purposes) to know if the same system is ISS with respect to arbitrary inputs. That 
is, it might be possible to pick the comparison functions in the ISS definition in a manner 
independent of the bound k. It is remarked in |Q| that "input semiglobal" ISS (ISS with respect 
to bounded inputs) indeed implies ISS. However, the proof given in that paper is for the more 
standard notion of ISS, for systems evolving in M. n , while we have generalized the notion to deal 
with systems with positive states. Thus the proof does not apply. 
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3.1 Proof of Theorem £| 



Pick any initial states x(0) G M> ana - £ ^>o °f the original system (|l|) and the observer, 
respectively. We let w(-) = (#(•), z(-)) be the maximal trajectory of the composite system 

x = f(x) 

z = f(z) + C'(H(x)-H(z)), 

which we also write as w = g(w), with initial condition (x(0), z(0)). We need to show that 
w(t) = (x(t), z(t)) is defined for all t > 0, and \z(t) — x(t)\ — > as t — » +oo. 

The rest of the argument is completely analogous to what was done for the other observer. 
Since we know that x(t) is defined for all i > and converges to some equilibrium x as t — > +oo, 
we must prove that z(t) is also defined for all t > and converges to this same x as £ — ► +oo. 

Since x(i) converges to x, so does H(x(t)) converge to H{x). Let k be a constant such that 
\H(x(t))\ < k for all t. Let U K be the set of vectors u with norm less than or equal to k, so 
that H(x(t)) G U K for every t > 0. Then Proposition applied with = H(x(t)) and 
u = H(x), shows that the solution z(t) exists for all t > and that z(t) satisfies 

- s| < /3(|z(0) - x\,t) + <p(\\H(x) - H(x)\\). 

where j3 G ICC, and <p G /Coo. 

Now convergence is established in a routine ISS manner: given any e > 0, pick T\ such that 

tp{\\H(x)-H(x)\\i)<l 

where \\H(x) - H(x)\\i = ess. sup .{\H(x(t)) - H(x)\ : t > Ti}. Next, pick T 2 > T x such that 

P(\z(Ti) - x\,t) < |, Vt>T 2 . 

Then, for all t > \z{t) — x\ < e. Therefore, z(£) — > x as t — > +oo as wanted. 



3.2 Robustness of the Alternative Observer 

In Sections |2||3[ we have considered the general models with inputs 

z = f(z) + C'(u-h{z)) (21) 

and 

z = f(z) + C'(u-H(z)) (22) 

which (besides providing us with observers when u{t) = h(x(t)) or u(t) = H(x(t)) may be 
useful in questions regarding the control of the class of chemical reactions described in the 
Introduction. 



We have shown in Proposition ^3 that, as long as u remains in a set U K for all times, 
trajectories of (22) are well defined for all t and satisfy an input-to-state stability condition. 

The system (|2l|) also exhibits an ISS stability property, but only with respect to a more 
restricted set U, namely, ||u|| has to be uniformly bounded by a fixed constant depending on x 
which, in particular, requires every coordinate of u to be nonnegative. 

In fact, if it happens that u(t) becomes less than an (arbitrary) amount — e on some interval 
of time (in the case of a systematic negative disturbance or some negative offset in a control, 
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for instance due to leakage outflows in a chemical reactor), then solutions to (^T]) may blow 
up in finite time. This problem can be seen in the following simple example. Take the same 
network as in Example 3^ with n = 3, m = 2, T> =span{(l, 1, —1)'} and rate constants equal 
to 1, for simplicity. Let 

"200 
2 



C 



and consider an input u(t) = (u\(t) , U2(t))' with 2(ui(t) + U2(t)) < —e for some e > and all 
t. Then ([21]) becomes 

xi = -xix 2 + x 3 +2(m — xf) 

±2 = ~X\X 2 + X 3 

i3 = +XlX2 - X3 +2(« 2 -x§). 
Now look at the variable x\ + x 3 : 

xi+x 3 = 2(ui + u 2 ) - [x\ + x\) < -s - -(xi + x 3 ) 2 . 

Viewing this as a differential inequality, we may conclude that (xi + x 3 )(i) < w(t) where w(t) 
is the solution to w = — e — ^w 2 , w(0) = (x\ + X3)(0), i.e., 



(xi + x 3 )(t) < -y^tan (^-t - ao^j 



where ao = arctan y j > o. Therefore, for a suitably chosen initial condition, w(t) 

blows up to — oo as t tends to -^(ao + §). Hence, since (xi + x 3 )(t) < w(t), x\ + x 3 also has 
to blow up to — oo in finite time, and this can happen even if all states started positive. 

In contrast, model (^), allows each coordinate of u to have negative values, for an arbitrarily 
large interval of time, and only requires that \u(t)\ < k for all t, for some (fixed but arbitratry) 
positive constant, k. 



4 Generalization to Systems with "Multiple Linkage Classes" 

We now look at the more general case of a system with vector field of the form (J2j) but where the 
matrix A = diag(Ai, . . . , Ai) is block diagonal, and each A s (of size m s ) is itself irreducible and 
has nonnegative entries (or, at least, there exists a permutation matrix, P, such that PAP -1 
has that diagonal form). The matrix B is also partitioned into B = [B\ ■ ■ ■ Bl], where each B s 
is of dimension n x m s and, since the assumption that B has full rank is still valid, each B s itself 

has full rank, m s (mi H h mz = m). The system ([!]) can be written as x = f% (x) H h fz (x) 

where each f s {x) is computed according to formula @ using A s and B s . 

The number L is called the number of "linkage classes" and denotes the (smallest) number 
of connected components of the incidence graph G(A). G(A) is the graph whose nodes are the 
integers {1, . . . , m} and for which there is an edge j — ► i, iff Ojj > 0. 

To each connected component there corresponds a space 

T> s = span {bi — bj : 6j, bj are columns of B s }. 
The assumptions on the B s imply that each space T> s has dimension m s — 1, 

V = V 1 ®---®V L , (23) 
(direct sum), and so dimP = (m\ — !) + ••• + (m^ — 1) = m — L. 
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Example 4.1 To illustrate this structure, consider a general enzymatic mechanism with un- 
competitive inhibitor, consisting of one enzyme E, one substrate S, one product P and an 
uncompetitive inhibitor I (Q and R are intermediate complexes): 

S + E^Q, Q k -^S + E, 
Q^P + E, P + E^Q, 



Q + I^R, 



R^Q + I. 



There are two linkage classes, L = 2: 

(i) the first class consisting of the complexes S + E, P + E and Q; 

(ii) the second class consisting of the complexes Q + I and R. 

For class (i), S + E, P + E and Q are the three nodes of G(A\) and, due to the reversibility of 
the reactions, it is possible to "connect" any two of these nodes through a path in the graph. 
The same is true for class (ii). 

Notice that the same complex, e.g., Q in the above example, may belong to different con- 
nected components (otherwise the problem could be reduced to two completely independent 
"single linkage" problems). 



In [16] it is shown that this system does not admit boundary equilibria in any positive class. 
Let x = (S,P,Q, R, E, I)'. Then B = [B x B 2 ] and A = diag (A 1 ,A 2 ): 
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The space T> is given by T>\ + T> 2 ■ 






/c_ 





k 2 


_ h 


k- 2 


V 2 : 





A 2 



A:_ 3 
k 3 



V x = span {(1,-1,0, 0,0,0)', (1,0,-1, 0,1,0)')} 
V 2 = span {(0,0,1,-1,0,1)'} 



and the function / is given by f% + f 2 : 



k-iQ - hSE 
-k^ 2 PE + k 2 Q 
-k-iQ + hSE + k^ 2 PE - k 2 Q 


k-iQ - hSE - k- 2 PE + k 2 Q 




, h{x) 






k^R - k 3 QI 
-k^ 3 R + k 3 QI 


k_ 3 R-k 3 QI 



For a general "multiple linkage" system we may consider output maps of the same form 
as before, i.e., monomials in the state variables, as in (|3|). These monomials may include any 
of the variables "xj" , and they have the same interpretation as before: either representing the 
concentration of some of the substances (as in the case of x\), or being proportional to some 
reaction rate (as in the case of x\x^, etc.). 
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The necessary and sufficient detectability condition given in Theorem § is still valid (The- 
orem H generalizes, as sketched in The main fact to verify is that d5|) still holds for the 



general case. But, from [20], we know that for an interior point x, f{x) = if and only if 
f s (x) = for each s = 1, . . . , L, (That is, x is an equilibrium of the entire system if and only 
if it is an equilibrium of every system x = f s {x)\ this nontrivial fact follows from the block 
irreducibility property.) Then, for each s, (||) says that, if x G E + , then, for any z £ R™ 



>o> 



p(x) - p{z) G T>jr ^ zeE. 



s 

+ ' 



where E+ is the set of interior equilibria of x = f s {x) (so E + = E]_ n . . . Pi EY). Equivalently, 
if x E E + , then, for any z G R" , 

p{x) - p{z) G T>i n . . . H T>l = T>^ ^> z £ E + . 

Thus, Equivalence (||) holds for L > 1 as well. 

So, a general system x = fi(x) -\ \- /l(x), y = h(x) is detectable if and only if the matrix 

C of the (exponents of the) output map satisfies either condition (c) or (d) in Theorem |2[ Note 
that more "linkage classes" mean more information is needed in order for the system to be 
detectable. For a n-dimensional system, the space T> has dimension m — L and detectability 
implies that the matrix C must have rank p = n — (m — L). As we have seen, a single linkage 
class requires p = n — m + 1, whereas multiple linkage classes require p = n — m + L. 

In the example above, for detectability of x = fi(x) + f2(x), y = h(x), C will need to have 
rank 3. The following output would be a suitable choice: h{x) = (S 2 Q, RI 2 , E)' . 

For a detectable "multiple linkage" system, an observer for x = f\ (x) + ■ ■ ■ + /l(x) is again 
of the form 

z = h{z) + ■■■ + f L (z) + C'(h(x) - h{z)). (24) 



To prove convergence of the observer, one may use Proposition BJjL Lemma 2.8 and Corol- 



lary 2.11 as before, after checking some points. 

The Q-invariance argument is unchanged since the particular forms of A and B still imply 



that fllq) and the corresponding conclusions hold. 



To see that Lemma |2,8| holds, we must analyse the term 

VV(z) f(z) = VV(z) h{z) + --- + VV(z) f L (z). 
For each s = 1, . . . , L, result (|i~4|) from |Q holds, so 

VV(z)f s (z) = (p(z)-p(x)J s (z)) 

< -c s (x) ^2 (bi-bj,p(z) - p(x)) 2 

< -c a (x)\D Mz) - p{x))\\ 

where i,j H B s means that only the columns bi,bj of B s are present in the sum. Each matrix 
Z?o, s is such that its rows (when transposed) span the space T> s . Therefore, because of (23), the 
rows (transposed) of the matrix Dq = [D' 1 ■ ■ ■ D' Q L ]' span the whole of T> and hence 



W(z)f(z) < -Y,Cs(x)\D 0iS (p(z)-p(m 2 

s=l 

< -minc s (x) \D (p(z) - p{x))\ 2 . 
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The rest of the proof of Lemma 2J is unchanged, since the form of the observer is the same as 
before. Thus the function V satisfies an estimate 



WO) f*(z,u) < - ai (\D (p(z) - p(x))\) - a 2 (\C(p(z) - p(x))\) + <*|u - Kx)\ 2 

where a\(r) = minc s (x) r 2 and a 2 £ /Coo and the constant C3 are as in the Lemma. Finally, 
detectability guarantees that T> + imC" = IR n (Theorem [2]), so D = [D' C']' has rank n which 
implies W(z) f*(z, u) < — a(\z — x\) + 7(|u — h(x)\) for some functions a, 7 £ /Coo- 



Corollary 2.11 is still valid, as follows from the estimate W(z) f(z) < verified above. 



5 Some Simulations 



5.1 Robustness of the Observers 

To numerically test robustness of our observers, we carried out some simulations to explore 
their responses in two cases: existence of noise in the output measurements and unknown 
inputs acting in the system we wish to observe. 

As a working example we choose x = f(x) to be the network proposed by McKeithan in | 
and took the output to be h(x) - 



(xiX?,, X1X4) 



as in Example 2.1. The constants were taken 



to be: k = 0.5, ks = 3, k^ = 2, (3% = 1. 

In Figure |, the convergence of both observers for this example is shown. 
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Figure 1: The trajectories of the system and observers without noise and without unknown 
inputs. The errors, as well as the trajectories of the first coordinate, are shown against time. 
The dotted line corresponds to the McKeithan network, the dashed line corresponds to our 
main observer, and the solid line to the alternative observer (where the logarithm of the output 
is used). 

In one simulation, white noise was added to the outputs, so that the equation for the main 
observer becomes 

z = f(z) + C'(h(x{t)) + n(t) - h(z)) 
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and n(t) is an M 2 -valued vector white noise. In view of the Section O, solutions of this system 
exist for all t > 0, provided that h(x(t)) +n(t) is nonnegative; and the observer should provide 
estimates close to the true state as long as the magnitude of n(t) is small. Thus we chose x(0) 
so that Xiit) > 2 and \n(t)\ < 2 for all t and all i. 

With output noise the alternative observer becomes 

z = f(z) + C'(p(h(x(t)) + n(t)) - p(h{z))) 

so the noise appears inside the logarithmic function. But under the conditions above we know 
that this system has a solution for all t (by Proposition |3.3j ). 

In Figure [2] we can see the trajectories of the first coordinates as a result of this simulation. 
The alternative observer, although slower in the convergence, has smoothed out the noise effect, 
whereas the main observer exhibits small magnitude perturbations. 

10.5^ 1 1 1 1 1 1 1 
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2 4 6 8 10 12 14 

Figure 2: The trajectories of the system and observers in the presence of observation noise. 

In another simulation, the model (|l|) was perturbed by a disturbance consisting of a periodic 
signal and two "delta" functions. The equation for the model is 

x = f(x) + d(t) 

where d(t) = (d\(t), 0, 0, 0)' and d\ is shown in Figure ||[ The function d\ was chosen so that 
(for the same initial condition x(0) as above) Xi{t) > for all i and all t. (The observers are still 
the ones for the nominal system, with no disturbance.) Note how our first observer catches- up 
after the "delta" disturbance, and also tracks (with a small lag) the limit cycle into which the 
observed system trajectories converge. 

In all simulations the initial conditions for system (|I|) were x(0) = (3, 2, 3, 20)' and for the 
observers we took z(0) = (5, 6, 10, 17)'. 

5.2 Comparison with Standard Observers 

In the chemical reactor literature, observers are typically constructed using an extended Kalman 
filter (EKF), or, less often, a Luenberger type observer (Lbg) for a linearized system. Neither 
of these approaches is guaranteed to work for nonlinear systems, but it is often the case that 
they perform adequately in specific examples, and hence their practical success. In the second 
part of this section, we compare the performance of our main observer with those of an EKF 
observer and of a Lbg observer. Our purpose in doing so is to illustrate that, even for very 
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t t 

Figure 3: The disturbance shown on the left was added to the system. On the right, its effect 
on coordinate 1 of the system and observers are plotted against time. 

simple examples, these standard techniqes can fail in a major way, while our observer is, as 
predicted by the theory, convergent to the right estimate. 

These two standard constructions both have the form 

z = f(z)+L(z)(h(x)-h(z)), 

where the gain L(z) is to be found in such a way that (at least locally) \x(t) — z(t)\ — > as 
t — »■ +oo. Let us briefly review these constructions. 

We consider the linearized dynamics of the error e = x — z, around the origin e = 0: 

e=[F(z(t))-L(z(t))H(z(t))]e, 

where 

F(z) = Df(e + z)\ e=0 and H(z) = Dh(e + z)\ e=0 

are the Jacobians of / and h evaluated at the point z. 

The gain L(z) for a (continuous) extended Kalman filter is given by 

L(z(t)) = P(t)H'(z(t))R-\ 

where P is a symmetric positive definite solution to the following Riccati differential equation: 

P = -PH'ET^HP + FP + PF' + Q, 

and R and Q are two positive definite cost matrices. 

A Luenberger type observer is obtained by finding a constant gain L such that the matrix 
F(x) — LH(x) is Hurwitz. A linearized error equation can also be written as: 

e = \F{x{t)) - LH(x(t))]e 

(note that the time-dependence of F and H is given in terms of a dependence on the trajectory 
of the system itself, instead of on the trajectory of the observer). It can be shown that, for 
initial conditions x(0) and z(0) sufficiently close to x, this error is asymptotically stable with 
respect to the origin. (Note that Luenberger observers, at least in their standard formulation, 
are not a reasonable choice for our example, since their design assumes the knowledge of the 
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equilibrium point around which we are observing. For multi-stable systems such as ours, it 
makes little sense to assume that this equilibrium is known - in fact, knowing this equilibrium 
amounts to solving the detectability problem. However, we can still study the behavior of 
a Luenberger observer, especially since we will show that it does not work even when this 
additional information is provided.) 

Here we take as working example the 2-dimensional system 
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where a is a positive real number and (in the notation used in the Introduction) 
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1 



and B 



3 2 

4 5 



The invariant classes, C a C K™ , are given by x\ + X2 = a and we choose a = a + 1, which has 
the (globally asymptotically stable with respect to the class) equilibrium x = (a, 1)'. 

We would expect both EKF and Lbg to converge provided that z(0) is in a sufficiently small 
neighborhood of x, but they may diverge when z(0) is far from x. 



o 

EKF 
Lbg 



0.5 



0.6 



Figure 4: Local convergence of the observers. The initial condition is z(0) = (4.4, 1.3)'. 
To choose the gain for Lbg we need to pick L = [l\ l%f such that the matrix 

F - LH 



„2 1 „a „3 ; r ,a 
— d — L\CL a — L\CL 

a? — ha 0, —a 3 — ha" 1 



is Hurwitz (that is, all its eigenvalues have negative real parts). For the simulations shown here 
let a = 4 and L = [-0.7 1.0]'. 

In Figure |||, it is seen that all observers converge locally, although the errors converge to 
zero in different time scales. 

For other initial conditions, away from x, both EKF and Lbg seem to diverge, and their 
performances are clearly inferior to that of our observer. In Figures ^ and ^ the behavior of the 
three observers is shown in several regions of the positive orthant. 

In all simulations the initial conditions for the system were x(0) = (0.3,4.7)' and the gain 
matrix for the Luenberger observer was L = [—0.7 1.0]'. For the Riccati differential equation 
the initial condition was -P(O) = 0.1/, and the cost matrices Q,R were constant and equal to 
/, / being the 2x2 identity matrix. 
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Figure 5: Both extended Kalman filter and Luenberger observer diverge for initial conditions 
on the higher part of the positive orthant of the plane. On the left plot z(0) = (3, 16)' and on 
the right plot z(0) = (19,6)'. 
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Figure 6: There are regions (close to the x-axis) where one of observer converges but the other 
diverges. On the left plot z(0) = (0.8,0.4)' and Lbg diverges. On the right plot z(0) = (6,0.1)' 
and EKF diverges. 



A Appendix 

A.l Upper and Lower Bounds for an ISS-Lyapunov Function 



For a general ISS-Lyapunov function defined according to Definition 2.4, one can show the 
existence of class /Coo bounds, as follows. Let 



and 



v\(r) = vai{V(z) : \z — x\ > r and z £ M> } 



^(f) = max{y(z) : \z — x\ < r and z G IK> }. 



The function v<i is continuous because V is, nondecreasing and satisfies ^(0) = 0. Without loss 
of generality, we can say that it is strictly increasing (by taking 1/2(1") + r). 
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The function u\ is finite, because for any r > 0, we may pick any x r with \x r — x\ = r and 
consider L := V(x r ). Then 

v\(r) = mxn.{V(z) : \z — x\ > r and z £ R> and V(z) < L} 

(the minimum over this set exists, because it is a compact). Thus v\ is also continuous and 
nondecreasing and z^i(0) = 0. Without loss of generality, v\ can be assumed to be strictly 
increasing. 

In the particular case of the function V defined in (12) the functions v<± may be taken as 
follows. Consider v : M>o — > M>o given by: 



v(a) 



(1 - o) m(l - a) + a, 0<a<l 
(1 + a)ln(l + a) - a + 2(1 - In 2), 1 < a < +oo. 



Then v is a /Coo function (note that 2(1 — In 2) is a positive quantity that guarantees continuity 
of v at a = 1). 

From (|l~2"l), recall that g(r) = r lnr + 1 — r for r > 0. Equivalently, 
g (l + a ) = (1 + a) ln(l + a) - a, for a > -1. 

Then 

nh +n) < S C 1 - - |a|) + |a|, |a| < 1 

y\ x ^ a )-\ (l + a)ln(l + a)-a + 2(l-ln2), 1 < a < +oo. 

So g(zi/xi) < v(\zi - Xi\/xi) and 

V (z) <£*v (£L-^y 

By using the fact that 

Zi Xi | \ / I Zj Xi 

< V 



mm Xj 



and also keeping in mind that v{ai) < v(y / af + • • • + a\ ), it follows that 
y(z) < n maxxj v(\z — x\/ minxj) = ^({z — x\). 

For the lower bound, consider 

w(a) = (1 + a) ln(l + a) — a, < a < +oo 

and notice that 

5 (1 + a) > 
So, g(zi/xi) > w(\zi - Xi\/xi) and 

V(z)>± XiW (\^-). 

i=i \ i / 

Using Lemma |A.3| repeatedly, we may conclude that V is lower bounded by a /Coo function. 



(1 + |a|)ln(l + \a\) - \a\, \a\ < 1 

(1 + a) ln(l + a) - a, 1 < a < +oo. 



34 



A. 2 Auxiliary Calculations 

The next lemmas state some simple inequalities which are used throughout the text. 



Lemma A.l Let v,w G] — l,+oo[. Then 

(ln(l + v)) 2 + (ln(l + w)) 2 > X - (ln(l + Vv 2 + w 2 ] 

Proof. First, note that (ln(l + v)) 2 > (ln(l — v)) 2 for all — 1 < v < by putting, 

g(v) = (ln(l + v)) 2 - (ln(l - v)) 2 = (ln(l + v) + ln(l - u))(La(l + «) - ln(l - v)). 

The second factor is clearly negative and the first factor is strictly increasing on the interval 
] — 1,0[ (by computing the derivative) and is when v = 0, therefore it must be also negative 
on this interval, implying that g(v) > as desired. 

Then it holds that 

(ln(l + v)) 2 > (ln(l + H)) 2 , Vu G] - 1, +oo[, 

so it is enough to prove the lemma for all v,w G [0, oof. Using the fact that x 2 is a convex 
function, it follows that 

1 11 

-(ln(l + v)) 2 + ln(l + w)) 2 ) > -(ln(l + v) + ln(l + w)) 2 = -(ln(l + v + w + vw)) 2 . (25) 

Moreover, since both v and w are positive: w + v > \Jv 2 + w 2 . The logarithm is an increasing 
function, so 

-(ln(l +v + w + vw)) 2 > -(ln(l + ^v 2 +w 2 )) 2 . (26) 
Putting ( |25| ) and (|^) together gives desired result. I 



Lemma A. 2 Let m > n and let D be an m x n matrix of rank n, and consider a fixed compact 
subset S of K>o- Then, there exists a class /Coo function a such that |L>(/)(x) — p(a))\ 2 > 
a{\x — a\) for all a G S and all x G K"o- 

Proof. Since D has maximal rank, it has a left pseudo-inverse, for example D# = (D'D^D'. 
Then, for any vector g G M. n , 

\g\ = \D#Dg\<\\D#\\\Dg\ |D e | > T^l^l- 



Also, if g = p(x) — p(a), 



a, 



g. = In^i = l n ( l + ) := \n{l+ Vl ). 



Then, applying Lemma A.l inductively on n we have 



max a,- 
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Now, clearly 

a( " ) = 2»-w H 1 + t)) 2 

with s an upper bound on the magnitudes of the coordinates of points of S, is of class /Coo, 
which proves the lemma. 

■ 

The following is easy to prove using 03(c) = min{ai(c/2), 02(0/2)}: 

Lemma A. 3 Assume that 0:1,02 are two /Coo functions. Let a and b be positive numbers. 
Then there exists another /Coo function, 03, such that 

«i(a) + a 2 (b) > az(\/ a 2 + b 2 ). 

for all a, b > 0. 



A. 3 Some Simple Facts Concerning Invariance 

Consider the scalar initial value problem 

x = F(t,x) (27) 
x(0) = x 

where the function F is assumed to have domain I x X, where X is an open subset of K and 
Z = [0, +00). Let F be locally Lipschitz in x and measurable in t, more precisely, 

(i) For each a G X there exists a real number r a and a locally integrable function a : R — > 
[0, +00) such that the ball of radius r a centered at a, B ra (a) C X and 

||F(t,x)-F(t,y)|| <a(*)||x-y|| 

for each t G 1 and 1,1/6 B ra (a). 

(ii) For each fixed a £ X, the function 5 : X — » <Y given by g(t) := F(t, a) is measurable. 

For each xo £ X let J = J Xo be the maximal interval of existence of solutions of ( |27| ) in 
forward time. This is an interval of the form [0, i max ) with < t max < +00. 

Using a standard comparison principle (as done for instance in |20[|), we have: 



Lemma A. 4 Consider system (27) with domain X = R and assume further that, 



x = F(t, 0) > Vt G X. 

Assume also that the initial condition is positive: xq > 0. Then x(t) > Vt G J, i.e., the 
solution of ( p7|) remains positive for all times in J. 



Lemma A. 5 Consider again the scalar initial value problem (|27|), but now with domain <Y = 
(0, +00). Assume that the initial condition is positive: xq > 0, and that the following property 
holds, for some < e < xo: 

Vt G R, F[t, x)>0 whenever x < 2e. (28) 

Then for all t G J, the solutions of (|27|) satisfy x(t) > e. 
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Proof. Suppose that the result is false, and let 

s = inf{t : x(t) < e}. 

We may assume that s < oo and s S J, because otherwise the desired result is trivial. In other 
words, let us suppose that the solution x(t) of ( |27| ) hits the value e at some finite instant s in 
J. 

Continuity of solutions gives 

35 = 5(e) such that \x(t)-e\<e/2 Vi G (s - 5,s + 6). 

By property fl28|), F(t, x(t)) > Vi G (s— 5, implying that a;(i) is strictly increasing on 
this J-interval and so x(s) > x(s — 5). But, then x(s — 5) < e, which contradicts the definition 
of s . I 



References 

[1] Angeli, D., E.D. Sontag, and Y. Wang, "Further equivalences and semiglobal versions of 
integral input to state stability," Dynamics and Control 10(2000): 127-149. 

[2] Bastin, G., and J.F. van Impe, "Nonlinear and adaptive control in biotechnology: A tuto- 
rial," European J. Control 1(1995): 1-37. 

[3] Bastin, G., and D. Dochain, On-line Estimation and Adaptive Control of Bioreactors, 
Elsevier, Amsterdam, 1990. 

[4] Bernstein, D., and S.J. Bhat, "Nonnegativity, reducibility, and semistability of mass action 
kinetics," Proc. 1999 IEEE Conference on Decision and Control, IEEE Publications, Dec. 
1999, pp. 2206-2211. 

[5] Dochain, D., E. Buyl, and G. Bastin, "Experimental validation of a methodology for on-line 
state estimation in bioreactors," in Computer Applications in Fermentation Technology: 
Modelling and Control of Biotechnological Processes (N.M. Fish, R.I. Fox, and N.F. Thorn- 
hill, eds.), Elsevier, Amsterdam, 1988, pp. 187-194. 

[6] Feinberg, M., "Chemical reaction network structure and the stabiliy of complex isothermal 
reactors - 1. The deficiency zero and deficiency one theorems," Review Article 25, Chemical 
Engr. Sci. 42(1987): 2229-2268. 

[7] Feinberg, M., "The existence and uniqueness of steady states for a class of chemical reaction 
networks," Archive for Rational Mechanics and Analysis 132(1995): 311-370. 

[8] Feinberg, M., "Lectures on Chemical Reaction Networks," 4.5 out of 9 lectures delivered 
at the Mathematics Research Center, University of Wisconsin, Fall, 1979. 

[9] Feinberg, M., "Mathematical aspects of mass action kinetics," in Chemical Reactor Theory: 
A Review (L. Lapidus and N. Amundson, eds.), Prentice-Hall, Englewood Cliffs, 1977. 

[10] Hale, J.K., Ordinary Differential Equations, Wiley, New York, 1980. 

[11] Horn, F.J.M., and Jackson, R., "General mass action kinetics," Arch. Rational Mech. Anal. 
49(1972): 81-116. 



37 



[12] Horn, F.J.M., "The dynamics of open reaction systems," in Mathematical aspects of chem- 
ical and biochemical problems and quantum chemistry (Proc. SIAM-AMS Sympos. Appl. 
Math., New York, 1974), PP- 125-137. SIAM-AMS Proceedings, Vol. VIII, Amer. Math. 
Soc, Providence, 1974. 

Krichman, M., E.D. Sontag, and Y. Wang, "Input-output-to-state stability," SI AM J. 
Control and Optimization, to appear. 

McKeithan, T.W., "Kinetic proofreading in T-cell receptor signal transduction," Proc. 
Natl. Acad. Sci. USA 92(1995): 5042-5046. 

Pous, N.M., A. Rajab, A. Flaus, and J.M. Engasser, "Comparison of estimation methods 
for biotechnological processes," Chem. Engr. Sci. 43(1988): 1909-1914. 

Siegel, D., and MacLean, D., "Global Stability of Complex Balanced Mechanisms", 
preprint. 

Sontag, E.D., Mathematical Control Theory: Deterministic Finite Dimensional Sys- 
tems, Second Edition, Springer- Verlag, New York, 1998. 

Sontag, E.D., "Smooth Stabilization Implies Coprime Factorization," IEEE Transactions 
on Automatic Control, Vol.34, No.4 (1989), pp. 435-443. 

Sontag, E.D., "Remarks on stabilization and input-to-state stability," Proc. IEEE Conf. 
Decision and Control, Tampa, Dec. 1989, IEEE Publications, Dec. 1989, pp. 1376-1378. 

Sontag, E.D., "Structure and stability of certain chemical networks and applications 
to the kinetic proofreading model of T-cell receptor signal transduction," IEEE Trans. 
Autom. Control, to appear. (Preprints available as math.DS /9912237| (1999), revised as 
math.DS/0002113| (2000) in Los Alamos Archive, http:arXiv.org.) 



Sontag, E.D. and Y. Wang, "On characterizations of the input-to-state stability property," 
Systems and Control letters 24(1995), 351-359. 

Sontag, E.D. and Y. Wang, "On characterizations of input-to-state stability with respect 
to compact sets," in Proceedings of IF AC Non-Linear Control Systems Design Symposium, 
(NOLCOS '95), Tahoe City, CA, June 1995, pp. 226-231. 

Sontag, E.D., and Y. Wang, "Output-to-state stability and detectability of nonlinear sys- 
tems," Systems and Control Letters 29(1997): 279-290. 



38 



